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In light of the finding of a previously unknown coronavirus as the aetiology of the severe acute 
respiratory syndrome (SARS), it is probable that other coronaviruses, than those recognized 

to date, are circulating in animal populations. Here, the results of a screening for coronavirus 

are presented, using a universal coronavirus RT-PCR, of the bird species graylag goose 

(Anser anser), feral pigeon (Columbia livia) and mallard (Anas platyrhynchos). Coronaviruses were 
found in cloacal swab samples from all the three bird species. In the graylag goose, 40 of 163 
sampled birds were coronavirus positive, whereas two of 100 sampled pigeons and one of five 
sampled mallards tested positive. The infected graylag geese showed lower body weights 
compared with virus-negative birds, suggesting clinical significance of the infection. Phylogenetic 
analyses performed on the replicase gene and nucleocapsid protein sequences, indicated that 
the novel coronaviruses described in the present study all branch off from group Ill coronaviruses. 


All the novel avian coronaviruses harboured the conserved s2m RNA structure in their 3’ 
untranslated region, like other previously described group Ill coronaviruses, and like the SARS 
coronavirus. Sequencing of the complete nucleocapsid gene and downstream regions of goose 
and pigeon coronaviruses, evidenced the presence of two additional open reading frames 

for the goose coronavirus with no sequence similarity to known proteins, but with predicted 
transmembrane domains for one of the encoded proteins, and one additional open reading frame 
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for the pigeon coronavirus, with a predicted transmembrane domain, downstream of the 


INTRODUCTION 


Coronaviruses are enveloped viruses with a positive- 
strand RNA genome (27-32 kb). They have been associated 
with disease in several warm-blooded animals, including 
humans, and in 2003, a previously unknown coronavirus 
was evidenced as the aetiological agent of the severe acute 
respiratory syndrome (SARS) that killed nearly 800 people 
from November 2002 to July 2003 (Drosten et al., 2003; 
Ksiazek et al., 2003; Kuiken et al., 2003). 


The coronaviruses comprise three serologically and geneti- 
cally distinct groups (Enjuanes et al., 2000; Gonzalez et al., 
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2003). Coronaviruses from group I and II have been found 
to infect several mammalian species, including humans, 
pigs, cows, dogs, horses, cats and rodents. Group III corona- 
viruses have been found to infect poultry, and include 
the chicken infectious bronchitis virus (IBV), the turkey 
coronavirus (TCoV) and the pheasant coronavirus. IBV 
causes an acute respiratory disease in chickens, while TCoV 
causes an acute enteric disease in turkeys. The poultry 
coronaviruses are antigenically similar and phylogene- 
tically related (Breslin et al., 1999; Cavanagh et al., 2001, 
2002). IBV has also been isolated from a peafowl and a 
teal, in a recent coronavirus screening of domestic birds 
(Liu et al., 2005), as well as from a flock of racing pigeons 
(Barr et al., 1988). Wild-bird species have been thought to 
be susceptible to coronavirus infection, as a coronavirus- 
like virus was found in ticks feeding on seabirds in Norway. 
However, this virus was not antigenically characterized 
further (Traavik et al., 1977). 
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The SARS coronavirus has been postulated to be an early 
split-off from the group II coronaviruses (Snijder et al., 
2003), while some authors have argued that there are evi- 
dences for recombination events in the evolution of this 
virus, involving both mammalian and avian coronaviruses 
(Rest & Mindell, 2003; Stanhope et al., 2004; Stavrinides 
& Guttman, 2004). Also, antigenic reactivity of the SARS 
coronavirus nucleocapsid with polyclonal antisera against 
group I coronaviruses, but not with polyclonal antisera 
against group II or group III coronaviruses, has recently 
been reported (Sun & Meng, 2004). 


An interesting feature of the SARS coronavirus genome is 
that it harbours a short motif (s2m) at its 3’ end, also found 
in group III coronaviruses, but not in other coronaviruses 
(Ksiazek et al., 2003; Marra et al., 2003; Wang et al., 2003). 
s2m is also present in all but one of the characterized 
astroviruses, and in one picornavirus (equine rhinitis virus 
B), and seems to be an element that is readily transferred 
between RNA virus genomes (Jonassen et al., 1998). The 
origin of the SARS coronavirus has still not been elucidated, 
but it has been proposed that the virus had an animal origin 
(Bell et al., 2004; Guan et al., 2003; Klempner & Shapiro, 
2004; Poon et al., 2004). The emergence of the SARS 
coronavirus has made it important to characterize further 
the coronavirus family to understand better which viruses 
are circulating in animal populations as potential zoonotic 
agents. 


In 2003, a Norwegian survey, originally aimed at investigat- 
ing the presence of enteropathogenic bacteria as well as 
influenza A virus and Newcastle disease virus in wild birds, 
was carried out. The sampling included fresh droppings, 
cloacal and tracheal swabs as well as a limited number of 
organs, from three bird species; the graylag geese (Anser 
anser), feral pigeons (Columbia livia) and mallards (Anas 
platyrhynchos). The same samples, as well as cloacal swabs 
sampled from graylag geese in August 2004, were investi- 
gated for the presence of coronaviruses, using RT-PCR. We 
report here the detection and characterization of novel 
coronaviruses infecting geese, pigeons and mallards. 


METHODS 


Droppings, swabs and organ sampling 


Graylag geese. A total of 119 fresh faecal samples from graylag 
geese were gathered in early April 2003 at eight different locations 
adjacent to lakes in the Klepp municipality (Rogaland county, 
south-western Norway). This is an important resting area for the 
graylag goose during its migration from wintering quarters in south- 
ern Europe to the nesting areas in Norway. The samples were kept 
at —70°C until analysis, and pooled six by six, into 20 pools, prior 
to analysis. 


Cloacal swab samples were collected from a total of 163 graylag geese 
shot during the hunting season on the islands of Smgla, More and 
Romsdal county, mid-Norway, (100 birds in early August 2003 and 
19 in early August 2004), and Vega, Nordland county, North-Norway 
(44 birds in early August 2004). Samples from the liver and kidneys 


were collected from a limited number of geese, and kept frozen until 
analysis. The sampling was usually carried out within 1 day after 
killing. The swabs were placed in tubes containing virus transport 
medium, and all samples were kept on ice until they reached the 
laboratory (2003) or they were kept frozen until analysis (2004). For 
all geese sampled, the age (juvenile or adult) was recorded, and for 
most of them, sex and body weight were recorded, together with some 
other physical parameters. 


Feral pigeons. One hundred pigeons (55 juveniles and 45 adults) 
were caught by net or cage traps at 11 different locations in Oslo, 
between late June and early September 2003. The birds were brought 
to the laboratory alive, euthanized and autopsied. Cloacal and tra- 
cheal swabs were taken from each bird and processed fresh, or kept 
at —70°C until analysis. Samples from several organs, including 
liver, lungs and spleen were taken from some of the birds and kept 
at —70°C. 


Mallards. Cloacal swabs were taken from five mallards (two juve- 
niles and three adults) caught by hand, in Oslo, on three occasions 
from early June to mid-October 2003. The swabs were placed in 
tubes containing virus transport medium, and the samples were kept 
on ice until they reached the laboratory, and processed fresh or kept 
at —70°C until analysis. 


RNA isolation and RT-PCR for coronavirus screening. RNA 
from swabs was isolated with QlAamp Viral RNA mini kit (Qiagen), 
and RNA from tissues was isolated with RNeasy mini kit (Qiagen) 
according to the manufacturer’s instructions. 


RT-PCR was performed using consensus coronavirus primers, 4Bm 
[5’-TCACA(C/T)TT(A/T)GGATA(A/G)TCCCA-3'] and 2Bp_ [5’- 
ACTCA(A/G)(A/T)T(A/G)AAT(C/T)TNAAATA(C/T)GC-3'], designed 
to amplify 250 bp of the replicase gene of all groups of coronaviruses 
(Stephensen et al., 1999). The samples were analysed with a two-step 
RT-PCR. cDNA synthesis was performed using SuperScript III RNase 
H”™ Reverse Transcriptase (Invitrogen) according to the manufac- 
turer’s protocol, with 5 pmole of the 4Bm primer. The RT reaction 
was performed in a thermocycler (MJ Research) at 55°C for 45 min, 
followed by an inactivation step at 70°C for 15 min. 


cDNA (5 pl) was added to a 50 pl PCR reaction and amplified by 
using HotStar Taq DNA polymerase (Qiagen). The primers used 
were 2Bp (3:75 M) and 4Bm (0-75 pM). The concentration of Mg’t 
in the reaction was 1-5 mM. 


The amplification programme consisted of an initial 15 min step at 
95°C, followed by 44 cycles with the following conditions: 2 cycles 
with 94°C for 60 s, 58°C for 25 s and 72°C for 50 s, 2 cycles with 
94°C for 60 s, 56°C for 25 s and 72°C for 50 s, and 40 cycles with 
94°C for 50 s, 55°C for 25 s and 72°C for 50 s. A final elongation 
step at 72°C for 5 min was performed, followed by chilling to 4°C. 


RNA isolated from a bovine faeces sample positive for coronavirus 
was used as a positive control in all RT-PCR set-ups, while negative 
controls consisted of RNase/DNase-free water. 


As sequence information became available for the degenerated 
positions in the 2Bp and 4Bm primers, new primers were designed 
and used further in the screening, using the same conditions for the 
RT and PCR. The new primer sequences were 4Bm-mod: 5’- 
TCACATTT(A/T)GGATAGTCCCA-3’ and 2Bp-mod: 5’-ACTCAG- 
(A/T) TGAATTTGAAATA(C/T)GC-3’. On some occasions, unspecific 
PCR products were observed, but only PCR products that could be 
verified by sequencing were considered positive. To increase further 
the specificity, another primer, 4Bm-7gaas [5’-T(A/T)GGATAGT- 
CCCAGCCCATA-3’], partly overlapping 4Bm-mod, was used instead 
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of 4Bm-mod in the cycle sequencing reaction, as well as in the PCR 
step of the RT-PCR performed on different tissues in the virus 
propagation experiments. 


Virus propagation. Faecal dropping and swab samples were 
routinely passaged twice in 9-day-old embryonated chicken eggs, 
inoculated in the allantoic cavity. Mortality was recorded, and a 
haemagglutination test was performed on all the samples after both 
passages. Non-inoculated embryonated eggs were used as controls. 


After detection of coronavirus RNA in a large part of the geese sam- 
ples, new inoculations were performed in a total of 26 embryonated 
chicken eggs, in several experiments. Inoculations were performed in 
the allantoic cavity of 16 9-10-day-old embryonated eggs, and in the 
amniotic cavity of three 15-day-old embryonated eggs (Adams & 
Hifstad, 1971; Jordan & Nassar, 1973). Several organs were sampled 
from the infected birds for coronavirus RT-PCR, including intestine, 
lungs, trachea, liver, kidneys, brain and spleen from the amniotic 
cavity inoculated eggs, and including the corioallantoic membrane 
(CAM) as well, from most eggs inoculated in the allantoic cavity. Organ 
sampling was performed 5 days post-infection for the eggs inocu- 
lated in the amniotic cavity and both 5 and 10 days post-infection 
for the eggs inoculated in the allantoic cavity. A second passage was 
performed later on, in the allantoic cavity of three 10-day-old eggs 
using kidney homogenates from the first passage as the source, and a 
new inoculation with geese cloacal swabs on four 10-day-old eggs was 
performed as well. RT-PCR for coronavirus detection and observa- 
tion of histological changes were performed on kidneys, liver and 
brain of these birds, 10 days post-infection. 


Similarly, organ samples positive for coronavirus from a pigeon 
were used to inoculate the allantoic cavity of a total of ten 10-day-old 
embryonated eggs. Several organs were collected from the embryos 
after 5 or 10 days of incubation, and tested for the presence of 
coronavirus by RT-PCR. Kidneys, spleen and liver were included in 
all cases, as well as intestine, trachea, CAM and lungs, from the chicken. 


Amplification of the 3’ end and 5’ RACE for sequencing. All 
other coronaviruses known to infect birds possess s2m in their 3'- 
untranslated regions (UTR). RT-PCR was performed using a primer 
designed in s2m [5’-CCGAGTA(C/G)GATCGAGGG-3'] as a sense 
primer and oligo(dT) s as the reverse primer, producing a PCR pro- 
duct for sequencing. For all novel coronaviruses described here, elec- 
trophoresis of the PCR products generated a smear, consistent with 
the presence of s2m, and the products were sequenced using the 
s2m primer, giving sequence information between s2m and the 
poly(A) tail. The sequence information allowed the design of two 
primers, coronaracel (5’-TTTTTTTTTTTGCTCTAACTCTAT-3’) 
and coronarace2 (5’-TTGCTCTAACTCTATACTAGCCTA-3’), which 
were used as a primer in RT and an antisense primer in PCR, 
respectively, followed by sequencing, using a 5’ RACE strategy. 
Several specific sequencing primers were then designed and used 
for primer walking, as described previously (Jonassen et al., 1998). 
When sequence information spanning the nucleocapsid gene had 
been obtained for one goose and one pigeon coronavirus isolate, 
PCR products for sequencing of other isolates were obtained using 
the nucleocapsid sense primers GC7 (5’-TGCAGTCACGTCTG- 
GAGAG-3’) for goose coronavirus, and DC5 (5’-CAGCACCAGT- 
CCTAAAACTT-3’) for pigeon coronavirus, towards the coronarace2 
antisense primer. 


Sequencing and sequence analyses. Sequencing was performed 
on purified PCR products, using the ABI PRISM BigDye Terminator 
Cycle Sequencing Ready Reaction kit v3.1 (Applied Biosystems) 
according to manufacturer’s instructions, and analysed on an ABI 
PRISM 3100-Avant Genetic Analyser (Applied Biosystems). 


Software used for sequence analysis were Sequencher version 4.1.4 


(Gene Codes Corporation; http://www.genecodes.com), FASTA simi- 
larity search and CLUSTALW Multiple Sequence Alignment Program 
(http://www.ebi.ac.uk/), BioEdit Sequence Alignment Editor version 
5.0.9 (Tom Hall, Department of Microbiology, North Carolina State 
University, North Carolina, USA; http://www.mbio.ncsu.edu/BioEdit/ 
bioedit.html), PUZZLE 4.02  (ftp://ftp.ebi.ac.uk/pub/software/dos/ 
puzzle/), PHYLIP package version 3.6 (Joe Felsenstein, Department of 
Genome Sciences, University of Washington, Seattle, Washington, 
USA; http://evolution.gs.washington.edu/phylip.html) and TREEVIEW 
(Win32) version 1.6.6 (Roderick D. M. Page, Division of Environ- 
mental and Evolutionary Institute of Biomedical and Life Sciences, 
University of Glasgow, Glasgow, UK; http://taxonomy.zoology.gla. 
ac.uk/rod/treeview.html). ORF analyses and protein domain predic- 
tions were performed using JustBio (http://www.justbio.com), PSORTII 
prediction (http://psort.nibb.ac.jp/form2.html), TMHMM _ server 
version 2 (http://www.cbs.dtu.dk/services/TMHMM/) and HMMTOP 
(http://www.enzim.hu/hmmtop/index.html). 


Data processing and statistical analyses. The statistical ana- 
lyses of the influence of the different physical parameters on body 
weight were performed in SAS-PC 8.3 for Windows with enterprise 
Guide (SAS Institute). 


The values of the weight of the birds (gram) were normally distri- 
buted, and therefore the association to the various independent 
variables was assessed in a multiple general least square model based 
regression analysis. The independent variables included in the analy- 
sis were the categorical variables; population (referring to the different 
islands and years, i.e. Smola 2003, Smola 2004 and Vega 2004), sex, 
age (juvenile/adult) and coronavirus detection (negative/positive). Sex 
was not recorded for 31 geese, and only the 132 birds with registra- 
tions for all variables were included in the analysis. The elimination 
criterion for variables in the model was the type-III F-test, a P-value 
of 0-05 being used as the level for exclusion. All variables were below 
this level, and they were all included. 


RESULTS 


Coronavirus screening with RT-PCR 


Only samples in which the presence of coronavirus could 
be confirmed by sequencing were considered positive. The 
findings are summarized in Table 1. 


Of the 20 pooled goose dropping samples collected in 
April 2003, three were found positive for coronavirus, and 
of the 163 cloacal swabs collected from graylag geese in 
August 2003 and August 2004, 40 (25%) were found 
positive for coronavirus. From four of the virus-positive 
geese (three adults and one juvenile), organ samples were 
available, and three of them (one juvenile, two adults) tested 
positive for coronavirus in kidney samples, while all four 
liver samples were negative. The characteristic of being 
virus positive contributed significantly to the variation in 
body weight of the geese (P<0-001), with an estimated 
weight difference of 177 grams (Table 2). 


Cloacal swabs from two adult pigeons were found to be 
positive for coronavirus. One of them also tested positive 
from the tracheal swab sample, while all other tracheal 
swab samples examined were negative. Organ samples 
were available only from one of the two positive pigeons, 
the one that was positive in both tracheal and cloacal 
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Table 1. Results of testing for coronavirus by RT-PCR on samples from graylag geese, feral pigeons and mallards in Norway, 


2003-2004 


Juv., Juveniles; Ad., adults. 


Bird species Sample material Place and year Age No. positive/analysed Positive rate (%) 
of sampling samples 
Graylag geese Droppings 6-sample pools Klepp 2003 - 3/20 - 
Cloacal swabs (n= 163) Smola 2003 Juv. 4/31 12:9 
Ad. 12/69 17-4 
Smola 2004 Juv. 4/6 66°6 
Ad. 2/13 15°4 
Vega 2004 Juv. 11/26 42:3 
Ad. 7/18 38-9 
Feral pigeons (n= 100) Cloacal swabs Oslo 2003 Juv. 0/55 0-0 
Ad. 2/45 4-4 
Tracheal swabs Juv. 0/55 0:0 
Ad. 1/45 2:2 
Mallards Cloacal swabs Oslo 2003 Juv. 0/2 0:0 
Ad. 1/3 33°3 


swabs. RT-PCR evidenced coronavirus in the liver and 
spleen, but not in the lungs of this bird. Both of the virus- 
positive pigeons were in a normal body condition, and no 
pathological lesions were found at necropsy. 


One of the mallard cloacal swab samples was positive for 
coronavirus by RT-PCR. This sample was also positive for 
low-pathogen influenza A virus (H3N8). 


Virus propagation 


All samples screened, except the one containing influenza 
A virus, were negative in the haemagglutination test. None 
of the attempts to grow goose or pigeon coronaviruses in 
embryonated chicken eggs was successful, as assessed by 
RT-PCR on sampled organs. Besides, there were no specific 
histological findings on organs sampled, and mortality 


was reported in only one of the eggs, inoculated with 
goose coronavirus, 10 days post-infection. No morphological 
changes, like dwarfing, were observed for any of the embryos. 


Prior to coronavirus screening, very little material was left 
of the original cloacal sample from the positive mallard, due 
to the influenza virus investigations. Therefore, no further 
attempts to grow duck coronavirus in embryonated eggs 
were initiated. Still, coronavirus RT-PCR was negative, 
when performed on allantoic fluid samples collected from 
the dead eggs inoculated in the allantoic cavity when 
investigated for influenza virus. 


Sequence analyses 


Fig. 1(a) shows the phylogenetic analysis of the corona- 
virus family, including the novel bird coronaviruses, on 


Table 2. The least square means estimates from the final multiple least square regression 
model of the individual weights of 132 graylag geese at the time of sampling in 2003 and 


2004, in relation to the coronavirus findings 


The final model explains 56-7 % of the observed variation in the bird weights. 


Type 3 P-value 
for variable (F-value) 


Independent variable Category (n) Least square means 
Population Smola 2003 (69) 2728 
Smola 2004 (19) 2706 
Vega 2004 (44) 2890 
Age Adult (85) 2944 
Juvenile (47) 2605 
Sex Female (52) 2606 
Male (80) 2943 
Virus detection by RT-PCR Negative (97) 2863 
Positive (35) 2686 


<0:01 


<0-001 


<0-001 


<0-001 
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911 
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1041 
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the part of the replicase gene that was sequenced in this 
study, and Fig. 1(b) illustrates the variation in the goose 
coronavirus isolates in that same gene part. 


The nucleocapsid genes from both pigeon and goose 
coronaviruses were sequenced, as well as the rest of the 3’ 
end of the genomes, while only a short part of the 3’ end 
was sequenced from the duck coronavirus, due to the 
limited quantity of the starting material. Fig. 1(c) shows a 
phylogenetic tree of the coronavirus family based on the 
nucleocapsid protein, and Fig. 2 shows the nucleotide 
alignment of the avian coronaviruses downstream of the 
nucleocapsid gene. 


The goose coronavirus was found to have two additional 
open reading frames (ORF) between the nucleocapsid 
gene and the 3’-UTR, named ORFxg and ORFyg, poten- 
tially encoding two proteins, 95 and 169-176 aa long, 
respectively, with no sequence homology to any other 
protein in GenBank. Several goose coronavirus isolates 
were sequenced in the 3’ region, all displaying the two 
additional ORFs, even if nucleic acid heterogeneity, 
including insertions and deletions, was observed, espe- 
cially in ORFyg. Amino acid sequence analysis predicted 
two transmembrane domains in the protein encoded by 
ORFyg for all analysed goose coronavirus samples 
(Fig. 3). No characteristic protein domain was predicted 
for ORFxg. 


Similarly, downstream of the nucleocapsid gene, the pigeon 
coronavirus was found to have one additional ORF, named 
ORFyp, potentially encoding a 63 aa long protein with a 
predicted transmembrane domain (Fig. 3). ORFyp showed 
no sequence homology to any other proteins in GenBank, 
nor to the additional ORFs of the goose coronavirus, at 
the amino acid level, but showed sequence similarity at 
the nucleic acid level with the 3’ part of ORFyg, and the 
5-terminal part of the 3’-UTR of some IBV and TCoV, as 
shown in Fig. 2. 


DISCUSSION 


Coronaviruses could be detected by RT-PCR in all the three 
avian species that were included in this survey. Although 
virus could not be isolated in embryonated chicken eggs 
by methods used in this study, sequence homology analysis 
of the sequenced PCR products obtained both in the rep- 
licase region and in the 3’ part of the viral genome, indicates 


that virus is present in these animals. Furthermore, the 
present coronavirus screening showed that RT-PCR based 
on primers aimed at detection of members of the different 
groups of coronaviruses (Stephensen et al., 1999) could 
be used for detection of novel coronaviruses. Coronavirus 
infection seemed to be widespread in the graylag goose 
populations sampled, as every fourth goose harboured 
coronavirus. The prevalence seemed to vary between 
geographical locations, years and age groups, and was 
higher in 2004 (38 %) than in 2003 (16%). This could be 
explained by different phases in the infection dynamics of 
the different populations at the time of sampling. The 
observed variation in the sequences of the goose corona- 
virus replicase gene further suggests that the virus is actively 
circulating in the geese. As juvenile and adult birds were 
found to be infected in approximately equal proportions, 
the virus does not seem to be the cause of a disease 
particularly affecting juveniles. 


The result of the regression analysis demonstrated that 
being virus positive had a significant influence on body 
weight, indicating that the infection has a clinical signi- 
ficance for the birds. From the tissue samples collected, virus 
could be detected in the kidneys of several geese, similarly 
to some IBV strains that have been shown to display a 
tropism toward kidneys (Albassam et al., 1986). As no 
tracheal swabs were sampled from the geese, the respiratory 
route of transmission was not investigated. Whether corona- 
virus infection in geese is predominantly an enteric, renal, 
respiratory or systemic disease therefore remains to be 
clarified. However, the large proportion of positive cloacal 
swab samples suggests that the faecal—-oral transmission 
route is important, with the intestinal tract of the geese as 
a potential primary target organ for coronavirus replica- 
tion. The presence of virus both in fresh droppings collected 
upon the arrival of the geese to Norway in springtime, and 
during the autumn hunting, suggests that a great propor- 
tion of the birds carry coronavirus throughout large parts 
of the year. 


One of the pooled goose droppings sample (03/436 7-12) 
displayed several ‘wobbled’ nucleotides suggesting it could 
be a mix of goose and duck coronaviruses, even if care was 
taken to pick up only freshly laid faecal samples upon 
gathering. The area where the droppings were gathered 
hosts several migrating bird species in the springtime, and 
some duck faeces might have been picked up together with 
goose faeces. Further investigations are, however, needed 


Fig. 2. Nucleotide alignment of the 3’ part of avian coronavirus genomes, showing two isolates of goose coronavirus (GO3/ 
586-50 and G03/586-77), limited sequence data of the duck coronavirus (D03/1094), and one pigeon coronavirus isolate 
(P03/653), compared with previously published IBV (Vic S, KB8523, N1/62, Beaudette, N1/88) and TCoV strains (Minnesota 
and NC95). The alignment starts 2 nt upstream of the stop codon of the nucleocapsid gene, and ends immediately upstream 
of the poly(A) tail. Dots indicate nucleotides identical to the GO3/586-50 sequence, gaps are shown as dashes, and missing 
data are shown blank. Predicted ORFs are underlined, with double lines under start and stop codons. Start of ORFs are also 
indicated by < above the sequences. For goose and pigeon coronaviruses, the putative TRS are shown in white on a grey 


background. s2m is shown as a shaded box. 
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GCoV 03/586-77 ORF yg 
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) 100 200 
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Fig. 3. Probability of transmembrane helices (shaded curves) 
and non-cytoplasmic localization (open curves), as predicted by 
TMHMM, for the proteins encoded by the ORFyg of two goose 
coronavirus (GCoV) isolates, the ORFyp of one pigeon corona- 
virus (PCoV) isolate and for the ORFs downstream of the 
nucleocapsid gene predicted in some IBV strains. Amino acid 
numbering is shown on the horizontal axis and probability on 
the vertical axis. 


to shed light on how widespread coronavirus infection is 
in mallards, as only five individuals were sampled. 


Coronavirus infecting pigeons was found in only two of 
100 analysed cloacal swabs, indicating a lower prevalence 
or shorter duration of the infection, as compared with 
graylag goose. Virus was detected in the liver and spleen 
of one of the swab-positive pigeons, suggesting a viraemic 
stage of infection in this species. Virus was detected in the 
tracheal swab sample but not in the lungs of the pigeon, 
leaving the mode of transmission of pigeon coronavirus 
uncertain. Necropsy did not reveal enlarged spleen, or 
other significant pathological findings in the coronavirus 
infected pigeons. 


Most IBV strains, as well as pheasant coronavirus, have 
been propagated in embryonated chicken eggs by inocula- 
tion of the allantoic cavity (Gough et al., 1996; Jordan & 
Nassar, 1973), while TCoV has been shown to replicate in 
embryonated chicken eggs by inoculation of the amniotic 
cavity (Adams & Hifstad, 1971). Propagation of the goose 
coronavirus was attempted by both allantoic cavity and 
amniotic cavity inoculation of embryonated chicken eggs, 
to include the enteric route of transmission, while propa- 
gation of the pigeon coronavirus was attempted by allantoic 
cavity inoculation alone. No replication in embryonated 
chicken eggs could be achieved for the pigeon or goose 
coronavirus in the present study, suggesting that these 
coronaviruses do not readily infect chicken embryo. Further 
attempts will be made to isolate these viruses either in 
embryonated eggs from goose or in cell culture. 


Phylogenetic analyses performed both in the replicase 
gene, and on the nucleocapsid protein, indicate that all 
the coronaviruses found in this study are novel, as opposed 
to the newly reported IBV infections in a domestic peafowl 
and in a teal from domestic bird flocks (Liu et al., 2005). IBV 
infection has also been reported in a flock of racing 
pigeons in Australia, and confirmed by serological tests 
to be of the same serotype as the widely used Australian 
IBV vaccine strains, but no sequence information is avail- 
able for this isolate (Barr et al., 1988). All the novel bird 
coronaviruses branch off from group III coronaviruses, 
indicating a monophyletic origin with other bird corona- 
viruses. Several IBV strains isolated in Australia are 
phylogenetically distant from most other TCoV and IBV 
strains, in the spike and nucleocapsid proteins (Sapats et al., 
1996a, b), and even if all the IBV/TCoV strains clustered 
together in a monophyletic group as opposed to goose and 
pigeon coronaviruses in the nucleocapsid protein, this 
branching is not strongly supported by bootstrap analysis 
(Fig. Ic). 


One of the unique features of the goose coronavirus, as 
compared with other bird coronaviruses is the presence of 
two additional ORFs at the 3’ end of its genome, with no 
amino acid sequence homology to other known proteins. 
While no specific domains were predicted for the pro- 
tein encoded by ORFxg, two transmembrane domains 
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were predicted for ORFyg. However, the putative RNA 
transcript of ORFyg harbours two very closely located 
methionine codons in different frames. The first methio- 
nine is the start of the longer frame, ORFyg, while the 
second is the start of a shorter ORF encoding a short 
protein (45 aa). The second methionine codon is, how- 
ever, in a better context of translation start in some of 
the goose coronavirus isolates, according to the rules of 
translation initiation proposed by Kozak (1995, 1997), with 
a G as the fourth base, immediately downstream of the 
ATG, and it cannot be ruled out that a protein encoding 
the shorter ORF is also synthesized. The protein encoded 
by this short ORF contains a transmembrane domain as 
well (data not shown). The group I coronaviruses trans- 
missible gastroenteritis virus (TGEV) and feline corona- 
virus (FCoV) have also been shown to encode one and two 
proteins downstream of the nucleocapsid gene, respec- 
tively (Herrewegh et al., 1995; Tung et al., 1992). The addi- 
tional ORF of TGEV and one of the additional ORFs of 
FCoV also encode proteins with transmembrane domains. 
However, in FCoV, as opposed to goose coronavirus, it is 
the most upstream additional ORF that encodes a protein 
with transmembrane domains. 


The short additional ORF found downstream of the 
nucleocapsid gene of the pigeon coronavirus is similar in 
length to the predicted ORF located in the hypervari- 
able region (HVR) of the 3’-UTR in some strains of IBV 
(Breslin et al., 1999; Sapats et al. 1996a; Williams et al., 
1993). Both goose and pigeon coronaviruses have a 
sequence (CTTAACAA) upstream of their nucleocapsid 
gene, similar to the transcription regulatory sequence 
(TRS) of IBV and TCoV [CT(T/G)AACAA]. A similar 
putative TRS (TTTAACAA) was found, upstream of both 
additional ORFs of the goose coronavirus, and of the 
single additional ORF of the pigeon coronavirus (Fig. 2), 
suggesting that mRNA specific for these ORFs are 
synthesized. 


The pigeon coronavirus ORFyp and the predicted short 
ORFs of some IBV strains, all display some nucleotide 
sequence similarity with the 3’ part of the goose coronavirus 
ORFyg, suggesting a possible homologous origin of these 
ORFs in bird coronaviruses (Fig. 2). The short predicted 
ORFs of IBVs, ORFyp and ORFyg all potentially encode 
proteins with transmembrane domains (Fig. 3), predicted 
by PSORTI to have a membrane topology of the 3a type 
(Hartmann et al., 1989; Singer, 1990), which locate toward 
the endoplasmic reticulum. This could suggest a similar 
function of these proteins to the short protein synthesized 
from the additional ORF at the 3’ end of the TGEV genome, 
that has been found to be associated with the endo- 
plasmic reticulum and cell surface membranes of infected 
cells (Tung et al., 1992). However, in contrast to the goose 
coronavirus ORFyg and pigeon coronavirus ORFyp, the 
additional ORF in some IBV strains, is not preceded by 
the TRS specific for IBV, suggesting that no mRNA specific 
for this ORF is synthesized. Even if internal ribosome 


entry has been suggested for the translation of some of 
the IBV proteins (Liu & Inglis, 1992), the fact that this 
ORF is mostly located in a part of the genome that is 
highly variable in IBV and TCoV, with loss of coding 
potential, or even largely deleted in some strains, indicates 
that it is not subject to strong conservative evolutionary 
constraints. Besides, the HVR has been shown experimen- 
tally not to be involved in virus replication and packaging, 
nor to be a part of the 3’-UTR that binds to nucleocapsid 
(Dalton et al., 2001; Zhou & Collisson, 2000). This could 
suggest that the HVR in IBV and TCoV is a truncated, 
presumably non-functional, remnant of a longer gene of a 
common ancestor of bird coronaviruses that is still pre- 
sent in the goose coronavirus, encoding two potential 
proteins. The pigeon coronavirus also harbours a truncated 
gene version, but the TRS upstream of it suggests that a 
specific mRNA is synthesized and potentially translated 
into a functional protein. Further studies are required to 
verify the presence and the function in virus replication of 
the additional proteins in goose and pigeon coronaviruses. 
The part of the 3’ end sequence obtained from the duck 
coronavirus was too short to assess the presence of an 
entire additional ORF, with a TRS and a start codon, but 
the extensive sequence similarity, both in nucleotide and 
predicted amino acid sequence, with goose coronavirus in 
the 3’ terminal part of ORFyg (Fig. 2), suggests that duck 
coronavirus also encodes a membrane-associated protein 
at its 3’ end. 


The sequences encoding the nucleocapsid gene in both 
pigeon and goose coronaviruses contain some internal 
ORFs, 40-70 codons long. No function was predicted for 
the proteins encoded by these ORFs, except for a nuclear 
localizing signal (NLS) in one of the goose and one of the 
pigeon coronavirus ORFs (Hicks & Raikhel, 1995; Robbins 
et al., 1991). Some other coronaviruses have been shown to 
synthesize proteins from internal ORFs in the nucleocap- 
sid genes (Fischer et al., 1997; Singh, 1999), with no known 
function. Two internal ORFs were reported within the 
nucleocapsid gene of the SARS coronavirus, as well (Marra 
et al., 2003), and it cannot be ruled out that some additional 
proteins are synthesized in the pigeon and goose corona- 
viruses from the internal ORFs. Besides, the nucleocapsid 
gene seemed to be much more conserved than the part of 
the replicase gene in the five goose coronavirus isolates that 
were sequenced in both regions (data not shown), sug- 
gesting some evolutionary constraints in the nucleocapsid 
gene compatible with the presence of translated overlap- 
ping reading frames. 


The nucleocapsid protein of several coronaviruses, includ- 
ing IBV, has been reported to be localized in the nucleolus 
(Hiscox et al., 2001; Ren et al., 2004; Wurm et al., 2001). 
The nucleolus is the site of ribosome biogenesis, and is 
involved in cell growth regulation. Cells expressing IBV-N 
protein have been shown to have retarded cell growth, 
delaying the cell cycle at interphase where maximum trans- 
lation of viral mRNAs can occur (Chen et al., 2002). The 
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nucleocapsid proteins of both pigeon and goose corona- 
viruses contained several NLSs (Hicks & Raikhel, 1995; 
Robbins et al., 1991), suggesting that their nucleocapsid 
proteins locate toward the nucleus as well. 


In conclusion, the present identification of novel corona- 
viruses indicates that coronaviruses also infect wild-bird 
species, and confirms the grouping of all known avian 
coronaviruses in group IIL. The findings suggest a mono- 
phyletic origin of the avian coronaviruses, as compared with 
the coronaviruses infecting mammals, and thus could not 
confirm a bird coronavirus involvement in the history of 
the SARS coronavirus. The relationship of the SARS 
coronavirus to group III coronaviruses was first postu- 
lated, since the presence of s2m at the 3’ end of the SARS 
coronavirus genome was one of the first evidenced 
sequences in the original search for the aetiological cause 
of SARS (Ksiazek et al., 2003; Wang et al., 2003). s2m was 
found in all of the novel bird coronaviruses, and seems to 
be a common feature of all group II] coronaviruses. The 
SARS coronavirus could have acquired s2m in an event 
independent of its presence in the bird coronaviruses, as 
s2m seems to be readily transferred between viral RNA 
genomes. Besides, the 3’-UTR of the SARS coronavirus 
has been found to replace functionally its counterpart in 
the group II mouse hepatitis virus (Goebel et al., 2004). s2m 
has recently been shown to have a rigorously conserved 
three-dimensional structure (Robertson et al., 2005), and 
further studies are required to shed light on its function. 
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